could be nice to have a pic at the start
knitr::include_graphics("map.png")
DC’s Capital Bikeshare service has been increasing in popularity, especially during the COVID-19 pandemic, when Washingtonians needed alternatives for public transport. To date it has 5,000 bikes and 600+ stations across 7 jurisdictions. However, users find the service unreliable at times, especially at peak times. In this project, our goal is to use and optimize supervised machine learning models that can predict the number of ride-sharing bikes that will be used at any give hour, using time and weather predictors. For simplicity, we apply our models to one station in particular that has particularly high demand: the Lincoln Memorial station. As such, our target variable will be the number of bikes that departed from that station at a given hour.
Being able to predict Capital Bikeshare demand, could result in a more efficient allocation of bikes when stations are re-stocked at night. It could also inform and it could inform the eventual expansion of stations across strategic locations across the city to improve the experience of Washingtonians.
We use Capital Bikeshare’s publicly available historic data, ranging from May 2020 until September 2021. After data cleaning, our dataset has 11,718 rows (CONFIRM?), each indicating the number of hours that departed Lincoln Memorial at a certain hour.
Duration – Duration of trip Start Date – Includes start date and time End Date – Includes end date and time Start Station – Includes starting station name and number End Station – Includes ending station name and number Bike Number – Includes ID number of bike used for the trip Member Type – Indicates whether user was a “registered” member (Annual Member, 30-Day Member or Day Key Member) or a “casual” rider (Single Trip, 24-Hour Pass, 3-Day Pass or 5-Day Pass)